AITopics | model bias

DPA: AOne-stop Metric to Measure Bias Amplification in Classification Datasets

Neural Information Processing SystemsJun-22-2026, 23:07:38 GMT

Most ML datasets today contain biases. When we train models on these datasets, they often not only learn these biases but can worsen them -- a phenomenon known as bias amplification. Several co-occurrence-based metrics have been proposed to measure bias amplification in classification datasets. They measure bias amplification between a protected attribute (e.g., gender) and a task (e.g., cooking). These metrics also support fine-grained bias analysis by identifying the direction in which a model amplifies biases. However, co-occurrence-based metrics have limitations -- some fail to measure bias amplification in balanced datasets, while others fail to measure negative bias amplification.

bias amplification, machine learning, natural language, (19 more...)

Neural Information Processing Systems

Country: North America > United States (0.28)

Genre: Research Report > Experimental Study (1.00)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.93)

Add feedback

Unlocker: Disentangle the Deadlock of Learning between Label-noisy and Long-tailed Data

Neural Information Processing SystemsJun-14-2026, 02:06:16 GMT

In real world, the observed label distribution of a dataset often mismatches its true distribution due to noisy labels. In this situation, noisy labels learning (NLL) methods directly integrated with long-tail learning (LTL) methods tend to fail due to a dilemma: NLL methods normally rely on unbiased model predictions to recover true distribution by selecting and correcting noisy labels; while LTL methods like logit adjustment depends on true distributions to adjust biased predictions, leading to a deadlock of mutual dependency defined in this paper. To address this, we propose \texttt{Unlocker}, a bilevel optimization framework that integrates NLL methods and LTL methods to iteratively disentangle this deadlock. The inner optimization leverages NLL to train the model, incorporating LTL methods to fairly select and correct noisy labels. The outer optimization adaptively determines an adjustment strength, mitigating model bias from over-or under-adjustment. We also theoretically prove that this bilevel optimization problem is convergent by transferring the outer optimization target to an equivalent problem with a closed-form solution. Extensive experiments on synthetic and real-world datasets demonstrate the effectiveness of our method in alleviating model bias and handling long-tailed noisy label data.

artificial intelligence, optimization problem, proceedings, (10 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.59)

Add feedback

UMFC: Unsupervised Multi-Domain Feature Calibration for Vision-Language Models

Neural Information Processing SystemsMar-22-2026, 13:56:51 GMT

Pre-trained vision-language models (e.g., CLIP) have shown powerful zero-shot transfer capabilities. But they still struggle with domain shifts and typically require labeled data to adapt to downstream tasks, which could be costly. In this work, we aim to leverage unlabeled data that naturally spans multiple domains to enhance the transferability of vision-language models. Under this unsupervised multi-domain setting, we have identified inherent model bias within CLIP, notably in its visual and text encoders. Specifically, we observe that CLIP's visual encoder tends to prioritize encoding domain over discriminative category information, meanwhile its text encoder exhibits a preference for domain-relevant classes.

artificial intelligence, machine learning, natural language, (8 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning (0.99)
Information Technology > Artificial Intelligence > Natural Language (0.85)

Add feedback

How to Fine-tune the Model: Unified Model Shift and Model Bias Policy Optimization

Neural Information Processing SystemsFeb-16-2026, 18:49:34 GMT

Designing and deriving effective model-based reinforcement learning (MBRL) algorithms with a performance improvement guarantee is challenging, mainly attributed to the high coupling between model learning and policy optimization. Many prior methods that rely on return discrepancy to guide model learning ignore the impacts of model shift, which can lead to performance deterioration due to excessive model updates. Other methods use performance difference bound to explicitly consider model shift. However, these methods rely on a fixed threshold to constrain model shift, resulting in a heavy dependence on the threshold and a lack of adaptability during the training process. In this paper, we theoretically derive an optimization objective that can unify model shift and model bias and then formulate a fine-tuning process. This process adaptively adjusts the model updates to get a performance improvement guarantee while avoiding model over-fitting.

machine learning, model shift, reinforcement learning, (15 more...)

Neural Information Processing Systems

Country: Asia > China > Shanghai > Shanghai (0.04)

Genre: Research Report (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.90)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models (0.67)

Add feedback

A Proofs 438 We first redefine notation for clarity and then provide the proofs of the results in the main paper

Neural Information Processing SystemsFeb-13-2026, 21:21:36 GMT

We first redefine notation for clarity and then provide the proofs of the results in the main paper. Now we first prove that the iteration in Eq.2 has a fixed point. Proof of Lemma 3.1: Let We present the bound on using empirical Bellman operator compared to the true Bellman operator. The proof can be found in [6]. Proof of Theorem 3.4: Recall that the expression of the V -function iterate is given by: Proof of Theorem 3.6: The proof of this statement is divided into two parts.

bellman operator, machine learning, reinforcement learning, (18 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.47)

Add feedback

f106b7f99d2cb30c3db1c3cc0fde9ccb-AuthorFeedback.pdf

Neural Information Processing SystemsFeb-11-2026, 01:42:38 GMT

correlation, dataset, likelihood, (13 more...)

Neural Information Processing Systems

Genre: Research Report (0.72)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.30)

Add feedback

Understanding Anomaly Detection with Deep Invertible Networks through Hierarchies of Distributions and Features

Neural Information Processing SystemsDec-24-2025, 21:08:25 GMT

Deep generative networks trained via maximum likelihood on a natural image dataset like CIFAR10 often assign high likelihoods to images from datasets with different objects (e.g., SVHN). We refine previous investigations of this failure at anomaly detection for invertible generative networks and provide a clear explanation of it as a combination of model bias and domain prior: Convolutional networks learn similar low-level feature distributions when trained on any natural image dataset and these low-level features dominate the likelihood. Hence, when the discriminative features between inliers and outliers are on a high-level, e.g., object shapes, anomaly detection becomes particularly challenging. To remove the negative impact of model bias and domain prior on detecting high-level differences, we propose two methods, first, using the log likelihood ratios of two identical models, one trained on the in-distribution data (e.g., CIFAR10) and the other one on a more general distribution of images (e.g., 80 Million Tiny Images). We also derive a novel outlier loss for the in-distribution network on samples from the more general distribution to further improve the performance. Secondly, using a multi-scale model like Glow, we show that low-level features are mainly captured at early scales. Therefore, using only the likelihood contribution of the final scale performs remarkably well for detecting high-level feature differences of the out-of-distribution and the in-distribution. This method is especially useful if one does not have access to a suitable general distribution. Overall, our methods achieve strong anomaly detection performance in the unsupervised setting, and only slightly underperform state-of-the-art classifier-based methods in the supervised setting.

Add feedback

b9b4f084b2e6709a2bfad0f601271aec-Paper-Conference.pdf

Neural Information Processing SystemsOct-9-2025, 05:50:03 GMT

machine learning, model shift, reinforcement learning, (15 more...)

Neural Information Processing Systems

Country: Asia > China > Shanghai > Shanghai (0.04)

Genre: Research Report (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Robots (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.70)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models (0.67)

Add feedback

6e469fbdc43ade121170f61096f4458b-Supplemental-Conference.pdf

Neural Information Processing SystemsOct-8-2025, 21:11:26 GMT

machine learning, reinforcement learning, time step, (19 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.47)

Add feedback

field, with a "scientifically rigorous", "fair " and "extensive " evaluation (R1,4,5) and 3 of 4 Rs advocating acceptance

Neural Information Processing SystemsAug-22-2025, 01:07:09 GMT

We thank the reviewers for their informative feedback, indicating improved results (All), that hypotheses are "intuitive" (see Section 6). Do partially joint models help? Still, it is interesting future work to try a joint network (see Discussion p.8). That shows local low-level features, beyond being correlated with the likelihood, dominate it. Overclaiming wrt MSP-OE (R5): We agree and would modify wording, e.g., to "slightly underperform".

dataset, likelihood, scientifically rigorous, (15 more...)

Neural Information Processing Systems

Genre: Research Report (0.72)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.30)

Add feedback

Filters

Collaborating Authors

model bias

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

DPA: AOne-stop Metric to Measure Bias Amplification in Classification Datasets

Unlocker: Disentangle the Deadlock of Learning between Label-noisy and Long-tailed Data

UMFC: Unsupervised Multi-Domain Feature Calibration for Vision-Language Models

How to Fine-tune the Model: Unified Model Shift and Model Bias Policy Optimization

A Proofs 438 We first redefine notation for clarity and then provide the proofs of the results in the main paper

f106b7f99d2cb30c3db1c3cc0fde9ccb-AuthorFeedback.pdf

Understanding Anomaly Detection with Deep Invertible Networks through Hierarchies of Distributions and Features

b9b4f084b2e6709a2bfad0f601271aec-Paper-Conference.pdf

6e469fbdc43ade121170f61096f4458b-Supplemental-Conference.pdf

field, with a "scientifically rigorous", "fair " and "extensive " evaluation (R1,4,5) and 3 of 4 Rs advocating acceptance